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ABSTRACT 

In this work I discuss the necessary steps for deriving photometric redshifts for lumi- 
nous red galaxies (LRGs) and galaxy clusters through simple empirical methods. The 
data used is from the Sloan Digital Sky Survey (SDSS). I show that with three bands 
only (gri) it is possible to achieve results as accurate as the ones obtained by other 
techniques, generally based on more filters. In particular, the use of the (g — i) color 
helps improving the final redshifts (especially for clusters) , as this color monotonically 
increases up to z ~ 0.8. For the LRGs I generate a catalog of ~ 1.5 million objects at 
z < 0.70. The accuracy of this catalog is a — 0.027 for z ^ 0.55 and a = 0.049 for 
0.55 < z ^ 0.70. The photometric rcdshift technique employed for clusters is indepen- 
dent of a cluster selection algorithm. Thus, it can be applied to systems selected by any 
method or wavelength, as long as the proper optical photometry is available. When 
comparing the redshift listed in literature to the photometric estimate, the accuracy 
achieved for clusters is a = 0.024 for z ^ 0.30 and a = 0.037 for 030 < z ^ 0.55. How- 
ever, when considering the spectroscopic redshift as the mean value of SDSS galaxies 
on each cluster region, the accuracy is at the same level as found by other authors: 
cr = 0.011 for z < 0.30 and a = 0.016 for 030 < z 0.55. The photometric redshift 
relation derived here is applied to thousands of cluster candidates selected elsewhere. 
I have also used galaxy photometric redshifts available in SDSS to identify groups 
in redshift space and then compare the redshift peak of the nearest group to each 
cluster redshift. This procedure provides an alternative approach for cluster selection, 
especially at high redshifts, as the cluster red sequence may be poorly defined. 
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1 INTRODUCTION 



Recent galaxy redshift su rveys (2dFGRS, IColless et al 1l200ll : 
SDSS, lYork et al l [2000) have provided the astronomical 
community a unique view of the local universe (z ~ 0.1). 
Such surveys are based on spectrographs that simultane- 
ously observe hundreds of objects. Although the improve- 
ment respective to a decade ago is enormous, larger and 
mainly deeper general spectroscopic surveys are not yet pos- 
sible with current instrumentation. Note that surveys tar- 
geting spec ific populations, such as Luminous Red Galax- 



ies fSDSS. lEisenstein et al.l 



(Wiggle-z, Idazebrook et all 



2001) or star- forming galaxies 



20071 ). can sample much larger 



volumes. For the mean time, photometric redshifts provide 
a valuable alternative to probe faint sources within large 
areas. 
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Photometric redshift techniques are essentially a mech- 
anism to convert photometric properties of galaxies (such 
as colors) into redshift and physical properties (e.g., lumi- 
nosity and type). Thus, with the proper choice of passbands 
and the use of an accurate photometric redshift algorithm, 
it is possible to map the distant universe in three dimen- 
sions. These surveys represent a powerful tool for studying 
the statistical properties of galaxies and their evolution. 

There are several photometric redshift estimators devel- 
oped to date. These can be generally classified either as em- 
pirical or template-based methods. In the first case a direct 
relation is obtained through the comparison of the photo- 
metric properties (colors) and spectroscopic redshifts. Such 
empirical relations can be derived, for instance, through 
olynomial fitting jlConnollv et aLl H-995) or neural networks 
Collister fc Lahavl l20o1 T The template-based algorithms 
rely on the availability of a set of galaxy templates. These 
should accurately represent the distribution of galaxy SEDs 
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and their evolution with look back time l|Csabai et al.|[20o3 ) . 
Hybrid photometric redshift techniques have also been pro- 
posed in the last few years. They combine the advantages 
of empirical and template fitting methods by iteratively im- 
proving the concordance between photometric data and the 
spectral energy distributions. In other words, the template 
spectra is reconstructed to best matc h the observed photo- 
metric measureme nts of each galaxy l|Budavari et al.l l2000l : 
ICsabai et aDl2003h . 

Luminous red galaxies are marked by uniform spectral 
energy distributions, characterized by a strong break at 4000 
A due to the accumulation of a number of metal lines. The 
shift of this feature through different filters is strong cor- 
related with redshift. These galaxies are also known to be 
some of the most luminous objects in the universe and are 
preferentially found at high density environments, rendering 
these objects an interesting tool for selecting and studying 
clusters. All that said, it is clear that LRGs comprise an 
optimal population to derive accurate photometric redshifts 
to very large distances. 

This paper describes the construction of a large photo- 
metric redshift catalog of LRGs at z < 0.70. This catalog 
is based on simple polynomial fitting of the relations be- 
tween galaxy colors and spectroscopic redshifts. I explore 
the use of different colors from SDSS, showing that with 
three bands only it is possible to achieve results compara- 
ble to more elaborated e mpirical techniques, such as ANNz 
ilCollister fc Lahavll2004). kd-trees or the nearest-neighbour 
method ( Csabai et al.l 20031 ). In addition, I employ similar 
relations to derive photometric redshifts of galaxy clusters. 

When estimating redshifts of clusters the main draw- 
back is the need to apply a background correction when 
selecting probable cluster galaxies seen in two dimensions. 
I discuss different possibilities when minimizing the back- 
ground effects, showing that the most precise results can be 
achieved when selecting the reddest galaxies (the selection 
is based on the u — r color) . Photometric redshift estimates 
of clusters rely on precise values of their median color. El- 
liptical or SO galaxies comprise the main population in the 
central regions of galaxy clusters. Thus, one would like to 
use these galaxy types when estimating the typical colors 
of clusters. As it is shown in §4.2, at low redshifts a simple 
statistical background correction is enough to minimize the 
influence of galaxies that do not belong to the clusters, and 
to accurately estimate cluster colors. However, at higher red- 
shifts this simple correction leads to an underestimation of 
cluster colors compared to the expected values for ellipticals. 
That is also due to the inc rease with redshift in the fraction 
of blue galaxies in clusters l|Butcher fc Oemlerlll984h . To cir- 
cumvent this problem, the use of the u — r color plays a key 
role to help the selection of early type systems and thus 
reduce the scatter of the observed colors of clusters. More 
details are found in §4.2. 

This paper is divided as follows. In the next section I 
describe the SDSS survey, which is used as the basis for ob- 
taining the empirical relations and evaluate the results. In 
section 3 I describe the selection of LRGs and the photomet- 
ric redshift technique employed for these objects. The same 
is done for clusters in §4, where I also make considerations 
about redshift accuracy. I also use galaxy photometric red- 
shifts from SDSS for the identification of groups in redshift 
space. I summarize the results in §5. Through this work I as- 
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The photometric and spectroscopic data for this paper were 
taken from the fi fth release of the Sloan Digital Sky Survey 
(|York et aUl200Ch . The SDSS consists of an imaging survey 
of 7r steradians of the northern sky in five optical passbands 
(ugriz), from 3,500-8,900 A. This will provide photometry 
for of order 5 x 10 7 galaxies. Spectroscopic will provide red- 
shifts and spectra for ~ 10 6 of these. The survey is carried 
out using a 2.5 m telescope, an imaging mosaic camera with 
30 CCDs, two fiber-fed spectrographs and a 0.5 m telescope 
for the photometric calibration. The imaging survey is taken 
in drift-scan mode and the data are processed with a pho- 
tometric pipeline (PHOTO) specially written for the SDSS 
data. 

Targets for spectroscopy are selected by the targeting 
pipeline from the imaging. Spectrosco pic fibers are assigne d 
to the targets by a tiling algorithm l|Blanton et al.| [2003). 
The minimum distance of 55 arcsec between the fibers leads 
to a loss of ~ 6% of galaxies, which is the main source of 
incompleteness. 

The spectroscopic survey is originally divided in three 
samples. The 'Main', flux-limited sample, has a median red- 
shift of 0.104 and a limiting magnitude of r pe tro ~ 17.77 
l|Strauss et alj|2002l ). As this limit is much brighter than 
that for the imaging, the redshift completeness is nearly 
100%. The second sample is the luminous red galaxy sample 
LRG), which is appro ximately volume-limited to z tn 0.38 
Eisenstein et al.ll200lf ). extending to z ~ 0.55. Finally, the 
quasar sample is defined by objects with colors distinct from 
those of ordinary stars. The completeness of this sample de- 
pends somewhat on redshift. In particular, the completeness 
is low for 2.4 < z < 2.9, where the quasar and stellar loci 
cross; it is similarly low at redshifts around 3.5 and 4.5. 

In addition to thes e data, the 2dF-SD SS LRG and 
Quasar Survey (2SLAQ; ICannon et al.l [20061 ) has been re- 
cently completed. This survey exploits the high-quality 
SDSS imaging combined with the extraordinary spectro- 
scopic capabilities of the Two-degree Field (2dF) instru- 
ment on the 3.9-m Anglo- Australian Telescope. It then re- 
sults in a spectroscopic redshift catalog of ~13,000 LRGs at 
0.4 < z < 0.7. These data can be found in the data release 
five (DR5) of SDSS or directly from the 2SLAQ websit<S In 
this paper I use photometric data from SDSS and spectro- 
scopic redshifts from SDSS and 2SLAQ surveys (available 
within SDSS). 

All the data selected from SDSS is from the DR5. I 
have selected only objects from the "Galaxy" view (so that 
only PRIMARY objects are allowed) in order to avoid dupli- 
cate observations. Standard flags for clean photometry are 
also enforced. When selecting spectra and imaging, a joined 
query of the Galaxy and SpecObj (objects with clean spec- 
tra) views is performed. All the magnitudes retrieved from 
SDSS are de-reddened (corrected for extinction) model mag- 
nitudes. 



http:/ /www. 2slaq. info/. 
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3 SELECTION OF LUMINOUS RED 
GALAXIES 

The selection criteria adopted for constructing a photomet- 
ric sample of LRGs i s analo gous to the description given in 
iPadmanabhan et al. (2005), which is aimed at selecting a 
uniform sample of LRGs at 0.2 < z < 0.7. Two different 
criteria are applied for selecting a low redshift sample (Cut 
I, z < 0.4) and a high redshift sample (Cut II, z > 0.4). 
Initially, two color tracks are defined: 



c± = (r-i)-(g-r)/A- 0.18 , (1) 

dx = (r - i) - - r)/8 wr-i. (2) 

Then, the following color cuts are applied 

Cut I : | c ± |< 0.2 ; (3) 

Cut II : d x > 0.55 , (4) 

g-r> 1.4 , (5) 



The final cut, g ~ r > 1.4, is effective on isolating the 
sample from the stellar locus. In addition to these selec- 
tion criteria, all galaxies with g — r > 3 and r — i > 1.5 
are eliminated. These last constraints are helpful on remov- 
ing stars with unusual colors, w ithout discarding real galax- 
ies jPadmanab han et al.ll2005l ). However, it is important to 
keep in mind that a 5% st ellar contamina ti on m ay still be 
present, as pointed out by ICollister et all (|2007t ) (see sec- 
tion 4 of their paper), who applied similar selection criteria. 
Nonetheless, their criteria leads to the selection of more ob- 
jects than the one adopted here. 

These color cuts are st ill not enough to sele ct LRGs 
from SDSS (see discussion in lEisenstein et al1l200ll ). There- 
fore, additional cuts in magnitude are applied. First, a color 
track which is approximately parallel to the low-redshift lo- 
cus is defined for Cut I 

C|| = 0.7(3 ~r) + 1.2(r - i - 0.18) , (6) 

Then, the following cuts are implemented 

Cut I : rpetro < 13.6 + C| |/0.3 , 

r Petro < 19.7 ; (7) 

Cut II : i < 18.3 + 2d ± , 

i < 20 . (8) 

rpetro is used for consistency with the original SDSS LRG 
target selection. Except for the numerical values of the mag- 
nitude cuts in equation 7, Cut I is identical to the SDSS LRG 
Cut I. The numerical values for Cut II are chosen to derive 
a population consistent with the first cut. At the redshift 
range sampled by Cut II the 4000 A break is moving through 
the r band. As a consequence, the r band K— corrections are 
very sensitive to redshift. Thus, using the i band for Cut II 
leads to a more robust selection. 

When applying these criteria to select LRGs from the 
DR5 of SDSS a total of 578,160 galaxies are selected using 
Cut I and 896,988 through Cut II. The combined sample, 
after excluding overlapping galaxies, adds to 1,459,536 lumi- 
nous red galaxies. This sample is from now on called the pho- 
tometric sample. Note that the high redshift sample has ap- 
proximately 74% of the MegaZ-LRG catalog (|Collister et al.l 



120071) . This last catalog was selected from the data re- 
lease 4 of SDSS (D R4). If the same criteria adopted by 
ICollister et all (|2007l ) is applied to DR5 the number of LRGs 
retrieved is ~1.4 million. So, the high redshift sample in 
the current work (Cut II) actually represents ~ 64% of the 
MegaZ-LRG catalog. That is due to the different criteria 
employed here. I allow only objects at 1A < g — r ^ 3 and 
r-iK: 1.5, while ICollister et all (|2007f ) uses 0.5 < g — r ^ 3 
and r — i < 2. Besides that, they select galaxies at d± > 0.5, 
while here the adopted cut is d± > 0.55. When imposing 
that galaxies should have spectroscopic measured redshifts 
and applying the same criteria as above, there are 197,956 
luminous red galaxies in SDSS. Out of these, 186,572 are at 
low redshifts (Cut I) and 11,384 at high redshifts (Cut II). 
The small number of galaxies with spectra available at high 
redshift is due to the fact that 2SLAQ was restricted to a 
small number of fields located in the equatorial stripe of the 
SDSS survey area. This set is called the spectroscopic sam- 
ple. Note that this sample does not include stars, as those 
were removed according to their spectroscopic identification. 



3.1 Photometric redshifts of LRGs 

The empirical photometric redshift estimators rely on the 
existence of a training set of objects with spectroscopic red- 
shifts. This set should be representative, in terms of photom- 
etry and redshift, of the target sample which will be used 
later on. The training set used here is the spectroscopic sam- 
ple (with 197,956 objects) mentioned above. Actually, this 
sample is divided into a "training" and "evaluation" sam- 
ples. I randomly selected 10,000 objects out of the 197,956 
LRGs to be the "training" set. The remaining 187,956 galax- 
ies are kept to play the role of an "evaluation" sample. The 
training set is then used to derive an empirical relation be- 
tween galaxy colors and redshift, which is then applied to 
the the evaluation set. I found that increasing the training 
sample to 20,000 objects does not represent a meaningful 
gain in accuracy. Forcing the "random" selection to have a 
fixed percentage at z > 0.40 (say 60%) also leads to similar 
results. 

On what follows I discuss which colors are best suited 
for deriving an empirical relation used for photometric red- 
shift estimates. In Figure 1 1 show the variation with redshift 
of the apparent magnitude r and of five SDSS colors, namely 
(u-g), (g-r), (g-i), (r-i), (i-z). The data points represent the 
10,000 galaxies randomly selected for the training sample. 
On each bandpass K-corrections are obtained through the 
convolution of a SEP characteristic of early type galaxies 
(taken from IColeman. Wu. fc Weedmanl Il98cl ; CWW from 
now on) with the SDSS filters. The expected colors in differ- 
ent redshifts are the result of adding the color of an elliptical 
galaxy at zero redshift and the difference in K-corrections 
between tw o bands. The zero redshift colors are tak en from 
table 3 of l|Fukugita. Shimasaku. fc~ Ichikawalll995l ). These 
color tracks are shown by the solid lines (red in the elec- 
tronic edition) of panels (b)-(f) of Figure 1. A small offset 
was noticeable between the color tracks and the data points. 
I estimated these offsets (a factor < 0.15) and took them in 
account for the figure. In panel (d) the dotted line (green in 
the electronic edition) indicates a second order polynomial 
fit to the relation between (g-i) color and redshift. In this 
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Figure 1. The variation of apparent magnitude r and five SDSS colors with redshift. The six panels show the following parameters 
versus redshift: (a) magnitude; (b) (u-g); (c) (g-r); (d) (g-i); (e) (r-i); (f) (i-z). The solid line (red in the electronic version) on each 
panel indicates the expected color variation of early type galaxies. In panel (d) the dotted line (green in the electronic edition) indicates 
a second order polynomial fit to the relation between (g-i) color and redshift. In the same panel the dashed line (blue in the electronic 
figure) shows the result of a fourth degree polynomial. 



panel the dashed line (blue in the electronic figure) shows 
the result of a fourth degree polynomial. 

A few features are readily noticed from the inspection 
of this figure. First, (u-g) shows a large scatter and does not 
follow the color track expected for an elliptical galaxy (due 
to the lower sensitivity of the u filter, specially for LRGs). 
Second, the (i-z) color shows little variation with redshift. 
Thus, we do not expect these two colors to contribute in a 
meaningful way for a photometric redshift estimator. Third, 
the (g-r) shows the expected large variation at low redshifts 
(z < 0.35), becoming nearly flat afterwards. The (r-i) color 
shows the opposite trend, being nearly constant at z < 0.35 
and increasing fast for higher-z. The (g-i) combines the re- 
sults of the two previous colors, showing a large variation at 
low-z and a less steep dependence at higher-z. These results 
are mainly associated to the shifting of the 4000 A break 
between the g and r filters at z ~ 0.35. Last, it is worth 
to note that the (r-i) color has a much smaller scatter in 



comparison to the ones based on the g— band (u-g, g-r, g-i). 
That happens cause the reddest filters are better suited for 
sampling these types of galaxies. 

I then used the training data to estimate different em- 
pirical relations between colors and redshift. These relations 
are based on a variety of combinations of colors, with the 
use of the r band magnitude in a few cases. These rela- 
tions are then applied to the 187,956 galaxies of the eval- 
uation sample. The redshift accuracy is characterized by 
the residual between the spectroscopic and photometric red- 
shifts (S z = z S p e c — z p hot) and the standard deviation 

ff = /jTTE^-^ • < 9 > 

where 8 Z i is the residual for the i-th galaxy and n —< 8 Z > 
is the mean residual. The sum is performed over all N data 
points. I also computed the mean and standard deviation 
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Photometric redshift errors of LRGs from different relations 



Relation fi a fio co Fraction(%) 



gi4 -0.0024 0.029 -0.0026 0.025 94.1 

giri -0.0008 0.028 -0.0012 0.024 97.1 

rgrri -0.0007 0.027 -0.0008 0.023 96.4 

rgiri -0.0007 0.027 -0.0008 0.023 96.5 

grgiri -0.0075 0.027 -0.0068 0.023 97.4 

grgirirz -0.0068 0.027 -0.0062 0.024 97.1 

grgiriiz2 -0.0062 0.028 -0.0058 0.025 96.9 



TEMP-CWW 0.0016 0.030 0.0013 0.026 99.1 



Table 1. The mean, standard deviation and fraction of LRGs 
with a valid photometric redshift (2 p h t > and <5 z o| < 0.10). 
All rows show the results of polynomial fits obtained using dif- 
ferent parameters. The first row lists the results when applying a 
polynomial of fourth order to the relation between the color (g — i) 
and redshift. In the second row I list the results when using the 
colors (g — i) and (r — i). Those based on the r magnitude and 
colors (g — r) and (r — i) are show in the third row. In the fourth 
row the results represent the use of the r magnitude and colors 
(g — i) and (r — i). The fifth row has the results when using the 
colors (g — r), (g — i) and (r — i). In the sixth row I show the 
results obtained when adding the color (r — z) to the previous 
set. The seventh has the result of a second order polynomial to 
the colors {g — r), (g — i), (r — i) and (i — z). Details about the fits 
are given in the text. In the last row I show the results achieved 
when using the color track based on the elliptical template from 
IColeman. Wu. &: Weedmanl l|l980i ) for the (g — i) color. 

(now called /iq and <to) for the case where the residual is 
weighted by the factor 1 + z spec (S z0 = (z spec - z phot )/(l ± 
Zspec))- When computing /i and a (or fio and <to) I only use 
galaxies with \5 Z \ or \5 z o\ < 0.10. This gives equivalent re- 
sults to reject outliers at the 3-cr level. Including the gross 
outliers increases the standard deviation, as a is very sensi- 
tive to the presence of outliers. For instance, in the fifth row 
of Table 1 (results based in the colors g-r, g-i and r-i) the 
values of a and cto are raised from 0.027 and 0.023 to 0.032 
and 0.026, respectively. 

The mean and standard deviation, as well the frac- 
tion of galaxies with a valid z p hot (the number of gross 
outliers is 100 minus this fraction), obtained for different 
empirical relations are summarized in Table 1. The last 
row of this table has the results obtained when consider- 
ing the color track based on the elliptical template from 
IColeman. Wu. fc Weedmanl fl980) for the (g — i) color. In 
other words, I simply use the track exhibited as a solid line 
in panel (d) of Figure 1 to compute redshifts from the ob- 
served (g-i) color. The remaining rows (1-7) in Table 1 show 
the results for the following polynomial fits: 



Zphot 


= A + B(g - i) + C(g - if + D(g - i] 


) 3 + E{g - 


%p hot 


= A + B(g-i)+C(r-i), 




%p hot 


— A + Br + C(g - r) + D(r - i), 




Zphot 


— A + Br + C(g + D(r - i), 




Zphot 


= A + B(g-r) + C{g - i) + D(r - i). 




Zphot 


= A + B(g-r) + C(g - i) + D{r - i) 


+ E(r - z] 


Zphot 


= A + B(g-r) + C(g + D{r - i) 


+ E(i-z) 




+ F(g - rf + G(g - if + H(r - if H 


-I(i-zf 



From the inspection of Table 1 we see no large dif- 



ferences among the different relations. It is interesting to 
note that the results based on the (g — i) color track (using 
the SED from CWW) show the highest fraction of galax- 
ies with a valid photometric redshift. However, the scatter 
determined in this way is a little larger when compared to 
other solutions. Besides that fact, the inspection of the re- 
lation z p hot x z apec shows a poor correlation for z 3pec > 0.30 
(for the TEMP-CWW results). We also note that the results 
based on the r band magnitude and colors have more out- 
liers than the solutions based on two or more colors (but no 
magnitude). The result based on the r magnitude and colors 
(g — r) and (r — i) is at the same level of the one that uses 
(g — i), instead of (g — r). So, for LRGs, the (g — i) does not 
seem to be superior to the (g — r) for redshift estimation. 
That is not true for galaxy clusters though (§4). I have also 
tried second and fourth order polynomial fits to the (g — i) 
color only. However, the overall results were not better. In 
Table 1 we can see that the fraction of outliers increases for 
this (g — i) fourth degree solution. Finally, it is interesting 
to see that the use of the z filter trough the r — z or i — z 
colors does not help improving the results. The scatter is at 
the same level when adding the r — z color, increasing a little 
if the i — z is employed with a second order fit. 

I also estimated the uncertainty in the photometric red- 
shift estimates, of each galaxy, through propagation of er- 
rors. In this process I consider the error in the coefficients 
of the empirical fits, as well as the photometric errors for 
each magnitude or color. These uncertainties are estimated 
as shown in the equation below, where I use the fractional 
uncertainties in the coefficients (such as A, B, ...) and in the 
color and magnitudes used on each fit (r,g — r,g — i, ...) 

A ZphDt = ^(^) 2 + ... + (^) 2 + ...'. (10) 

After inspecting Table 1 and plots of the z p hot x z ap£C 
relation I decided to consider as the final catalog of photo- 
metric redshifts the one based on the (g—r), (g—i) and (r — i) 
colors. The coefficients for the empirical relation derived for 
this case are 

A = -0.3068 ± 0.0006 
B= 6.2005 ±0.1333 
C = -5.9933 ±0.1331 
D= 6.4932 ± 0.1324 

This decision was mainly motivated by the fact that this 
relation is the one to produce fewer outliers and to show 
small individual galaxy redshift errors. The results based on 
relations that involve the magnitude r have large galaxy 
error estimates, due to the large fractional uncertainties 
in the magnitude coefficient That is the main rea- 

son for not adopting one of these relations. A plot with 
the comparison between z p hot and z spe c obtained with the 
(g — r), (g — i) and (r — i) colors is shown on Figure 2. 
Here I plot 15,000 randomly selected points from the eval- 
uation sample (comprising 187,956 galaxies). It is impor- 
tant to mention that these results are i n good agreement to 
what has been found by ot her authors (jPadmanabhan et al.l 
120051 : ICollister et al.l I2007T ) and the residuals show no sys- 
tematic trends with z 3pec . When considering only galaxies 
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Figure 2. Comparison between Zp^ot and z spec using the empir- 
ical relation based on the (g — r), (g — i) and (r — i) colors (upper 
panel). In the lower panel the residuals <5 z o = (z spec — z phot )/ (1 + 
Zspec) are shown. The solid lines (red in the electronic edition) 
indicate the Y = X result on top and the zero residual in the 
bottom panel. 



with 



< 0.55, a = 0.027 and ao = 0.023, while for 



galaxies at 0.55 < z spec ^ 0.70, a = 0.049 and a = 0.040. 

Table 2 presents the luminous red galaxy catalog 
derived from the relation based on (g — r), (g — i) and 
(r — i) colors. The parameters listed are ra, dec, u, g, 
r, i, z, u_err, g_err, r_err, i_err, z.err , z_phot , 
err_z_phot and objID (the object ID within SDSS). The 
magnitudes are the de-reddened model magnitudes. When 
the estimator led to a negative photometric redshift I set 
z_phot and err_z_phot to -9.99. That happens for 8,509 
galaxies (only 0.6% of the total catalog). 



4 GALAXY CLUSTERS 

The knowledge of clusters redshifts is essential for estimating 
other physical parameters of these systems (such as lumi- 
nosity and richness). Accurate redshifts are also crucial for 
large scale structure studies. Perhaps the oldest and sim- 
plest way to get photometric redshifts is through the use 
of single band galaxy magnitudes within the cluster region. 
However, the use of colors turned the photometric redshifts 
of clusters much more accurate in the last years. Color based 
techniques explore the fact that clusters have a large pop- 
ulation of early type galaxies which are characterized by 
a strong break at 4000 A. The observation of this feature 
through different filter shows a well defined correlation with 
redshift. 

Some cluster selection methods explore this character- 
istic, estimating a cluster redshift at the tim e of detec- 
tion ijGladders fc Yeeil2005l ; iKoester et aHl2007l) . while oth- 



ers d o the estimation independently of the selection proce- 
dure (|Gal et al.ll2003l ) . Typical accuracy of these color based 

methods is < 10%. 

The catalogs from lGoto et alj (|2002T ) and lKoester et all 

are based on SDSS. So, their results provide an opti- 
mal basis for comparison to what I find here. Two of the 
goals aimed in this work are: the derivation of redshifts 
independent of the selection procedure; and the extension 
of these estimates to z ~ 0.55, when using SDSS data to 
r = 21. The first point is motivated by the fact that some 
methods for cluster selection (such as the matched filter or 
the maxBCG technique) provide a redshift estimate as the 
output of the selection procedure. Although that is a great 
advantage, it also makes it difficult to use these methods 
for redshift estimation of clusters that do not properly fit 
the required properties imposed by these techniques. The 
photo-z technique presented here is applicable to clusters 
detected by different methods. I also noticed that none of 
the SDSS cluster catalogs available in the l iterature have 
redshi ft estimates to z ~ 0.55. The catalog of IKoester et al.l 
(|2007l ) focus the regime at 0.1 < z < 0.3 (some other cat- 
alogs in SD SS also do not go deepe r than that ). How ever, 
the work of iKim et all (|2002h and iGoto et al.l (|2002l ) use 
galaxies at r ^ 21 and r ^ 21.5, respectively; and these cat- 
alogs are not intentionally d riven to a low red shift regime. 
The redshift estimates from IKim et alj (|2002l ). based on a 
matched filter (MF) technique were intentionally truncated 
at z = 0.5 (as they ran the M F up to this redshift). The 
results from Goto et "all (|2002l ) use the g — r color track for 
redshift estimation, but as seen from their figure 14, their 
photo-zs are truncated at z ~ 0.44, while the spectroscopic 
sample used for comparison goes to z — 0.5 and they prob- 
ably detect higher redshift systems (considering the magni- 
tude limit adopted). In other words, although accurate they 
underestimate the redshifts (at least in the high-z regime). 



4.1 Selection of a Training Sample of Galaxy 
Clusters 

For the determination of photometric redshifts of galaxy 
clusters the first step that should be taken is the com- 
pilation of a list of objects with measured spectroscopic 
redshifts. Unfortunately, there are not so many clusters 
with spectra taken at z > 0.3, which biases our sam- 
ple to low-z clusters. Our calibration sample consists 
of 512 clusters ov er the area cove r ed by SDSS (DR5). 
These come from IStruble fc Rood! (Il999i); iHolden et all 
ill999l); Ivikhlinin et all (|l998h : ICarlberg et alj (|l996l ) arid 
iMullis et all ([2003) . The combined sample of these refer- 
ences contains 1805 clusters in the whole sky. After select- 
ing all clusters with redshifts at 0.02 z ^ 0.55 and outside 
70.0 < a < 110.0 or 270.0 < a < 300.0 we are left with a 
list of 1055 clusters. The right ascension limits are meant 
to avoid most systems outside the SDSS region. However, 
many systems that do not overlap with SDSS are still al- 
lowed in this list. Then I select data from SDSS for all these 
clusters. Those falling off the SDSS limits will obviously con- 
tain no galaxies. For the remaining I generate finding charts, 
which are inspected to see if the regions around each cluster 
(8.0 Mpc x 8.0 Mpc) are well sampled (I exclude clusters 
with excised regions near their centers). The final list com- 
prises 512 systems. The sample size may not be large enough 
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Example of the luminous red galaxy catalog 

ra dec u g r i z u-crr g-err r-err i-crr z-crr z_phot err-zp objID 

0.001571 14.982669 26.399 22.074 20.319 19.488 19.079 0.667 0.159 0.050 0.039 0.126 0.47247 0.06714 587730774425600743 

0.002045 -9.793661 22.227 21.282 19.469 18.829 18.490 0.609 0.083 0.025 0.023 0.073 0.38900 0.03420 587727179523227851 



Table 2. Example table of the LRG catalog containing 1,459,536 objects. The magnitudes listed are the de-reddened model magnitudes. 
The full table is available on-line or upon request to the author. 



to properly include evolutionary effects. However, the dom- 
inant population of red galaxies in clusters is believed to 
evolve passively with redshift. So, for clusters, the average 
color estimated from these galaxies should provide a clear 
correlation with redshift. In the future it will be interest- 
ing to use larger training samples of clusters for empirical 
photometric redshift estimators. 

The next step is to investigate which colors provide the 
best connection to spectroscopic redshifts. When doing that 
I noticed that the relations between magnitude or colors to 
redshift are very well established. However, caution should 
be taken for the background correction (§4.2) and also to 
the selection of clusters which will be used as the evalua- 
tion sample. I noticed that most of the 512 clusters can be 
well represented by the color tracks of elliptical galaxies. 
However, there are a few outliers, specially at low redshifts, 
which should definitely be avoided when training a photo-z 
estimator. These are mainly associated to wrong redshifts 
(clusters with few galaxies with available spectra) or projec- 
tion effects (if there is one or more clusters aligned to a low 
redshift system the color inferred will probably be wrong). 

I then decided to gather further information from NED 
to exclude clusters with a small number of galaxies with 
redshift available. When doing that I select from NED all 
galaxies within 3 arcmins of each cluster center. After in- 
specting the information retrieved for each cluster I kept 
only those with at least 3 galaxies with a concordant red- 
shift. Besides that, NED also provides a "special note" for 
some clusters, meaning that there is some peculiarity with 
the object (most of times it is a double system or there are 
differ ent redshifts listed for i t). Most of the 512 clusters are 
from IStruble fc Rood! (|l999h who also gives the number of 
galaxies used for measuring the redshift (may be different 
from above, as the aperture is not 3'). I then impose that 
the clusters should not h ave the "special note" in NED and 
have at least 3 galaxies in lStruble fc Ro od (1999). The final 
training set comprises 132 systems. Nearly all high redshift 
clusters are kept (z > 0.4). 



4.2 Magnitudes, colors and background correction 

In this section I show how I compute mean magnitudes and 
median colors and apply a background correction along this 
process. Figure 3 shows the color tracks for elliptical (E) 
galaxies (upper panel) and late type (Sbc) galaxies (bottom 
panel) obtained from the convolution of the CWW templates 
with the SDSS filters. The colors shown are (it — g), (u — r), 
(g — r), (r — i), (i — z). Other colors are not exhibited to avoid 
confusion. The analysis shown here is complimentary to the 
discussion done for the LRGs (§3.1). From the top panel it is 
easy to conclude that the (i — z) color provides no meaningful 



information for redshift estimation, while (r — i) is expected 
to be a powerful redshift discriminator at 0.40 < z < 0.60 
and (g — r) works fine at z < 0.40. The two colors based on 
the u magnitude (u — g and u — r) show a steep variation 
with redshift at z < 0.40 (with u — g being flat at z < 0.20). 
However, these two colors have a large scatter (§3.1) due to 
the lower signal to noise of the it band, specially for early- 
type galaxies, rendering their use for redshift estimation not 
relevant. But that does not prevent us from using the u — r 
to separate early and late type galaxies (see below). 

All mentioned so far is in line with the discussion in 
section 3.1 (about Figure 1). Perhaps the most important 
information that could be extracted from Figure 3 is a way 
to discriminate early and late type galaxies. We see that the 
colors (g — r), (r — i), (i — z) are not good for that purpose as 
the tracks of Es and Sbcs are too close and even overlap in 
some redshift ranges. The other two colors (it — g and u — r) 
seem to be useful for this separation. However, we can note 
that at low redshifts (z ~ 0.15) the difference between Es 
and Sbcs is approximately 0.50 mag only for the u — g color, 
while something similar to that does not happen for the u — r 
color at a fixed redshift. But it is important to mention that 
the difference between the lowest value of the u — r color 
for ellipticals (at 2 = 0) and the highest for late types (at 
2 ~ 0.32) is of 0.40 mag. So, when using the u — r color a high 
redshift blue galaxy could be mistaken for a low redshift red 
galaxy. Even considering that and the already mentioned 
large scatter of these colors I find that they are helpful for a 
rough discrimination between galaxy types, which is enough 
for the purpose of this work. I adopted (u — r) as it shows 
better results and for being well know f or having a bimod al 
color distribution (|Strateva et al.ll200ll ; [Driver et al.ll2006l ). 

After this discussion we are now ready to compute mean 
magnitudes (r— band) and median colors (g — i and r — i, 
for instance) for all the 132 clusters of the training sam- 
ple. For each cluster the number of galaxies as a function 
of magnitude (N r ) and colors (N( 3 _ i ) ( r _ i )) is determined 
(within and aperture of 0.50 h -1 Mpc, or equivalently 0.71 
Mpc for h = 0.7). I use bins of 0.10 mag for generating his- 
tograms of these counts. Blank fields are used to estimate the 
background counts. Fifty random regions (with 0.5° radius) 
are selected in the sky. When computing the background 
counts I exclude boxes with values outside the boundary de- 
termined by the mean ± 3<r of the 50 original boxes. For 
each magnitude or color bin the final background value is 
the mean of counts from the valid background boxes. Then 
I generate the background magnitude and color distribu- 
tions in the same way as done for each cluster, but scaling 
the counts for the cluster area. All clusters have their magni- 
tude and color distributions corrected from the background 
histograms, leading to the net cluster histograms. These are 
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Colors vs redshift (E) 

u-g 

u-r 




H 1 1 1 1 1 1 1 1 1 1 1 1 r 



Colors vs redshlft (Sbc) 

u-g 

u-r 
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0.2 0.4 0.6 



Figure 3. Color variations as a function of redshift. The top 
panels show color tracks for an elliptical SED, while the bottom 
panel is for a late type SED (Sbc). Both SEDs are from CWW. 
The colors exhibited are: (u — g) with solid line; (u — r) as dotted 
line; (g — r) as short dashed; (r — i) with the long dashed line and 
(i — z) as the dot-dashed line 

used to compute the mean magnitude and median colors of 
the clusters. 

I tested three different possibilities for obtaining these 
corrected cluster counts. The procedure is executed exactly 
as described above, but in the first case I consider all galax- 
ies with the r band magnitude less than 21 (m r ^ 21, which 
is approximately the star/galaxy separation limit of SDSS) 
and in the second I count only galaxies with m r ^ 21, 
but with m* — 3 ^ m r ^ m* + 3, where m* is the ap- 
parent characteristic magnitude of clusters. I consider the 
bright end values of the doubl e-Schechter cluster lu minos- 
ity function (LF) obtained by IPopesso et al.l l|2006l ). They 
found a = -1.09 and M* = -20.94 within 7? 2 oo. This value 
of M* is converted to the same cosmology used here and 
to the proper value at z — (taking the mean redshift 
of their sample as z = 0.1). Then, for different redshifts I 
adopted an evolutionary correction to th e value of M*, given 
byM '(z) = M*(0) + Qz, with Q = -1.4 (|Yee fc Lopez-Cruj 
1 19991 ) . The absolute characteristic magnitude (M*) is con- 
verted to m* through the application of the distance mod- 
ulus formula to each cluster redshift (the variation of m* 
with redshift is shown in the upper panels of Figure 4). The 
third test done is to compare the values obtained when con- 
sidering all galaxies to the results found when imposing a 
selection according to the (u — r) color. 

In Figure 4 I show the dependence of mean apparent 
magnitude r mean (two upper panels) and two SDSS colors 
(median values; four lower panels) with redshift. The colors 
exhibited are g-i and r-i. In the top panels the solid line 
represents the expected variation of the apparent character- 
istic magnitude of clusters, while in the four lower panels the 



solid line indicates the expected color variation of E galax- 
ies. In all panels, every black dot is a cluster (the 132 of the 
training sample), while the triangles with error bars (red in 
the electronic edition) indicate the mean values in redshift 
bins of 0.05. The left panels (a, b and c) show the results for 
the case where counts are taken at m* — 3 ^ m r ^ m* + 3 
and a color cut (in u-r) is applied for selecting galaxies. 
The exact values for this cut are chosen after comparing the 
u-r color tracks of Es and Sbcs in figure 3. Initially, I se- 
lected only galaxies with u — r ^ 2.40, which already led 
to very good results. However, I found that these could still 
be improved if a variable cut (with redshift) was applied. 
The choice of this cut affects most the higher redshift clus- 
ters (z > 0.20). I finally decided to select only the galaxies 
with u - r > 2.00 for clusters at z ^ 0.20, u - r > 2.30 
at 0.20 < z < 0.40 and u - r > 2.45 at z > 0.40. I found 
these cuts to gi ve the most acc u rate p hotometric redshifts. 
For comparison IStrateva et al.l (|200ll ) find that early and 
late types can be well separated by a simple color cut at 
u-r = 2.22. 

In the right panels of figure 4 I show the results of not 
applying one of the constraints mentioned above (the fixed 
luminosity range m* — 3 ^ m r ^ m* + 3; or the color cut 
in u-r) . Panel (d) shows the results for r me an when using all 
galaxies at r ^ 21 (counts are not restricted to a fixed lumi- 
nosity range, but the u-r color cut is still enforced) . Finally, 
panels (e) and (f) have the median color variations when I 
do not impose a color cut in u-r (but the fixed luminosity 
range is still applied). 

It is worth mentioning that the further constraints ap- 
plied above (in luminosity and color) are intended to im- 
prove the background correction, which is done by the sub- 
traction of every cluster histogram (in magnitude and colors) 
by the background distribution. From panel (a) we can see 
that the mean cluster magnitudes show a strong variation up 
to redshift ~ 0.4. After that the relation tends to become 
flatter. That is due to the magnitude limit considered for 
the survey (r = 21.0), which renders the cluster counts at 
m* — 3 ^ m r ^ m* + 3 truncated for high redshift systems 
(as m* + 3 extrapolates r = 21.0). Applying a correction 
to the r mean values, to take in account this truncation, re- 
sults in no meaningful improvement in the accuracy of the 
photometric redshifts. When comparing panels (a) and (d) 
there is a remarkable difference between computing r mean 
within a fixed luminosity range (m* — 3 ^ m r m*+3, 
for instance) or using the full survey limits (I considered all 
galaxies with r ^ 21 for panel d). The main effect is the 
overestimation of the counts at low redshifts (z < 0.15), 
which leads to the flattening of the r mean x z relation in 
this regime. That happens because when using all galaxies 
at r ^ 21 for low-z clusters, we sample magnitudes that are 
too faint in comparison to the relevant regime of a cluster 
LF (such as m* — 3 ^ m r ^ m* + 3). Then, the r mean values 
become biased towards higher values, with also a noticeable 
increase in the scatter. For clusters at z > 0.15 there is no 
visible difference because m* + 3 is always close to r = 21 
(or the survey limit is even extrapolated for high-z systems). 
A similar discussion, but for richness co mputation (instead 
of r mean ) is done in lLopes et alj (l2006t ). On what regards 
colors, the use of all galaxies at r ^ 21 has no large impact. 
We only see very few clusters that have their median colors 
offset from the color tracks, increasing a little the scatter. 
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Figure 4. The variation of mean apparent magnitude r m ean and two SDSS colors (median values) with redshift. The top two panels 
show the r mean variation, while the lower four panels exhibit the median color variations (g-i in the two middle panels and r-i in the 
lower two). The left panels (a, b and c) have the results when considering counts at m* — 3 ^ m r $C m* + 3 and with a color cut 
(u-r) applied for galaxy selection (this cut is explained in the text). Panel (d) shows the results for the mean magnitude when using all 
galaxies at r ^ 21 (counts are not restricted to a fixed luminosity range, but the u-r color cut is still applied). Finally, panels (e) and 
(f) have the median color variations when I do not impose a color cut in u-r (but the fixed luminosity range, m* — 3 $C m r ^ m* + 3, 
is still enforced). The solid line on each panel indicates the expected color variation of early type galaxies (except for panels (a) and (d) 
where they indicate the expected variation of the apparent characteristic magnitude of clusters). Each black dot represents one of the 
132 clusters of the training sample, while the triangles (red in the electronic edition) with error bars indicate the mean values in redshift 
bins of 0.05. 



From the comparison of panels (b) and (e) we notice 
a clear trend for underestimation of the g-i color at z > 
0.15. The effect is more pronounced at z > 0.40. The same 
effect is noticed for the g-r color (not shown in the plot). 
For r-i the effect is not too drastic and we only see a mild 
underestimation at z > 0.40. Nonetheless, these results are 
very useful to show the relevance of imposing a color cut 
(according to u-r) for measuring cluster colors. By doing so, 
we can restrict the analysis to early type galaxies, rendering 
the derived colors in good agreement to the expectations of 
elliptical galaxies. 



the mean magnitudes and median colors of clusters, esti- 
mated as in the left panels of figure 4. In other words, on 
top of the background corrections I require galaxies to have 
m* — 3 ^ m r ^ m*+3 and also impose a color cut (in u-r), as 
described above. For the 132 clusters of the training sample 
I derived the values of r mea „, (g — i) me dian and (r — i) me di an , 
which are shown in figure 4. An empirical relation between 
these three parameters and redshift is then derived. Other 
colors are also obtained and will later be used for compari- 
son to the results based on the two above. This relation can 
be expressed by 



4.3 Photometric redshifts of galaxy clusters 



I then proceed to derive empirical relations to estimate pho- 
tometric redshifts of clusters. That is done in a similar way 
to what is shown in section 3.1 for LRGs, but here I use 



Zphot ~ A + B 

1 mean 1 Jmedian v ) median ; 

(11) 
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analogous to the ones employed for LRGs. The derived co- 
efficients are 

A = -0.4424 ± 0.0084 
B = 0.0076 ± 0.0006 
C = 0.2382 ± 0.0024 
D = 0.2126 ± 0.0042 

To assess the accuracy of the photo-z estimator this 
relation is applied to all 512 clusters with known spectro- 
scopic redshifts (section 4.1). However, as we would do with 
clusters with unknown redshifts, we have to start the pro- 
cedure with a guess redshift and iterate it until the pho- 
tometric redshift difference between two iterations is less 
than 0.01. A maximum of 10 iterations is allowed. Conver- 
gence is not found for only 2 of the 512 clusters. This itera- 
tive procedure is necessary as we sample 0.5 h^ 1 Mpc and 
m* — 3 ^ m r ^ m* + 3 for each cluster, but we do not know 
what the redshift of the cluster is (used to determine this 
radius and luminosity range for galaxy selection). So, we 
start with a guess value (z guess = 0.15), compute the mean 
magnitude and median colors and apply the empirical rela- 
tion obtained above. Then we use the new redshift estimate 
to repeat the procedure until convergence is achieved. The 
Zphot x z spec comparison is shown on the top panel of Fig- 
ure 5, while the weighted residuals is exhibited in the lower 
panel. Clusters with |<5 z o| < 0.10 (see definition in §3) are 
shown as filled circles (489 or 96% of the 512). The remain- 
ing have open circles. If I had used more than 132 clusters 
(for instance, all the 512) to derive the photometric redshift 
empirical relation the final results would have a slightly in- 
creased scatter. That is due to the fact that some clusters 
have spectroscopic redshifts derived from a small number of 
galaxies. That is why it is important to use a clean sample 
to train the calibration (§4.1). 

In Table 3 I summarize the results obtained for relations 
based on different combinations of mean r magnitude and 
median colors. Analogously to what was done for LRGs, I 
only use clusters with \5 Z \ or \6„o\ < 0.10 when computing 
H and a (or /io and er ). Had I included the gross outliers, 
the standard deviation for the rgiri relation (third row of 
Table 3) is raised from 0.026 and 0.023 to 0.045 and 0.039 
(values of a and cr , respectively). Note that the fraction of 
outliers is only ~ 5%. See also the next section for some 
considerations regarding accuracy. 

The results in this table are first shown for all clusters 
and then for those at z < 0.30 and z > 0.30. On each case, 
the first four rows list the results considering galaxies at 
m* — 3 ^ m r ^ m* + 3 and with the u-r color cut applied. 
The fifth row exhibits the values when not imposing a fixed 
luminosity range, while the last row has the results without 
a color cut. When considering all clusters (but only the first 
four rows), we see no large differences, except for the val- 
ues achieved for the rgrri relation (r mean , (g-r) me dian and 
(r-i) me dian) which have a (or ao) a little higher than the 
rest. The same is also true for low redshift clusters (which 
dominate the sample) and the high redshift ones. Actually, 
the fraction of clusters at high-z with a valid redshift de- 
creases a little for the girt and grgiri cases. Considering 
that, I decided to adopt the redshifts obtained with r mean , 
(g-i)median and (r-i) medio „, as they have fewer outliers and 
low dispersions at all redshifts. The coefficients for equation 



Photometric redshift errors of clusters from different relations 



Relation 






Mo 




Fraction(%) 


gin 


-0.0021 


0.026 


-0.0015 


0.023 


95.1 


rgrri 


-0.0056 


0.028 


-0.0052 


0.025 


95.3 


rgiri 


-0.0046 


0.026 


-0.0040 


0.023 


94.9 




-0.0023 


0.026 


-0.0021 


0.023 


95.1 


rgiri (r 21) 


-0.0021 


0.028 


-0.0021 


0.025 


95.1 


rgiri (no u-r cut) 


-0.0021 


0.029 


-0.0018 


0.026 


93.9 




Clusters with 


z < 0.30 






gin 


-0.0020 


0.024 


-0.0019 


0.022 


95.3 


rgrri 


-0.0069 


0.025 


-0.0065 


0.024 


95.3 


rgiri 


-0.0050 


0.024 


-0.0048 


0.022 


94.9 


grgiri 


-0.0023 


0.025 


-0.0025 


0.023 


95.3 


rgiri (r $J 21) 


-0.0025 


0.026 


-0.0029 


0.024 


95.4 


rgiri (no u-r cut) 


-0.0024 


0.026 


-0.0024 


0.024 


96.1 




Clust 


ers with 


z > 0.30 






giri 


-0.0023 


0.037 


0.0016 


0.030 


92.9 


rgrri 


0.0051 


0.041 


0.0052 


0.031 


94.6 


rgiri 


-0.0007 


0.037 


0.0026 


0.030 


94.6 


grgiri 


-0.0022 


0.036 


0.0016 


0.030 


92.9 


rgiri (r $J 21) 


0.0013 


0.042 


0.0042 


0.033 


92.9 


rgiri (no u-r cut) 


0.0010 


0.048 


0.0045 


0.041 


76.8 



Table 3. The mean, standard deviation and fraction of clusters 
with a valid photometric redshift (z p f lot > and \S z o\ < 0.10). All 
rows show the results of polynomial fits obtained using different 
parameters. In the first row I list the results when using the colors 
(g — i) and (r — i). Those based on the r magnitude and colors 
(g — r) and (r — i) are show in the second row. In the third row 
the results represent the use of the r magnitude and colors (g — i) 
and (r — i). The fourth row has the results when using the colors 
(g — r), (g — i) and (i — i). Fifth row exhibits the results obtained 
with colors r mean , (g-i) med i a „ and (r-i) media „, but not being 
restricted to galaxies at m* — 3 ^ m r ^ m* + 3. All galaxies at 
r ^ 21 are used in this case. In the sixth row the results are again 
for r magnitude and colors (g — i) and (r — i), but the galaxies 
selected have no u-r cut applied. In the continuation of the table, 
the same type of information is also given for low (z ^ 0.30) and 
high redshift clusters (z > 0.30). 

11 are listed above. These results show the importance of 
using r mean and (g-i) me dmn for photometric redshift esti- 
mation at z < 0.55 in SDSS. The use of (g-r) me dian leads 
to an increased scatter (mainly at high-z), while using only 
colors (without r mean ) gives lower completeness at high-z. 

On what regards the results without the luminosity or 
color constraints (rows 5 and 6), they are noticeably worst 
than when these constraints are enforced. When not apply- 
ing the luminosity restriction the results are at a similar 
level than what is obtained in the rgrri case (except for 
the number of outliers at high-z). Similar values are also de- 
rived for the case where no color cut is applied, but only 
when considering low-z clusters. At z > 0.30 the standard 
deviation and fraction of outliers achieved without the color 
cut is the highest among all. That tells us how important 
is the selection of early type galaxies for cluster photo-z es- 
timation at higher redshifts. Without this pre-selection the 
cluster regions will likely be contaminated by lower redshift 
sources or, more importantly, by blue galaxies (which have 
an increased fraction at higher redshifts). Thus increasing 
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Figure 5. Comparison between Zp^ot and z spec using the empir- 
ical relation based on the r m ean, (g ~ i)median and (r - i) med ian 
colors (upper panel). In the lower panel the residual (<5 z q = 
(zspec — Zphot)/ (1 + Zspec)) is shown. The solid lines indicate the 
Y = X result on top and the zero residual in the bottom panel. 



the final error and biasing the results to low values. In other 
words, a simple background correction is not enough, even 
considering that we are working with a small aperture (0.50 
ft -1 Mpc). 



4.4 Considerations about photometric redshift 
accuracy 

It is important to mention that the redshift accuracy 
achieved in this work is lower than what other authors found 
with SPSS data. I estimated a = 0.024 for z < 0.30, while 
iGoto et alJ (I2002TI had estimated uncertainties of a = 0.015 
and IKoester et all ||2007|) of a = 0.01 at 0.10 < z < 0.30. 
Note that Goto et aL I l|2002l ) show residuals for clusters at 
2 > 0.08. The inclusion of lower redshift clusters in our sam- 
ple (z > 0.05) helps increasing the scatter, but by no means 
can explain the differenc e to other results. 

IKoester et all i|2007l ) estimate redshifts as part of the se- 
lection of clusters. Each cluster has assigned to it the redshift 
of a galaxy (brightest cluster galaxy, BCG) that maximizes 
the lik elihood of representing a cluster center. IGoto et alJ 
(2002) have the redshift estimates done after cluster detec- 
tion (with the cut and enhance method) . Their estimates are 
based on an early version of the maxBCG technique , and 
are not identical to the ones from IKoester et all (120071 ). For 
a given redshift, they start counting the number of galaxies 
within the cluster detection radius brighter than M r =-20.25 
and within ±1 mag in q — r around the color prediction fo r 



elliptical galaxies IjFukugita. Shimasaku. fc Ichikawalll995h . 
The procedure is repeated for several redshifts in steps of 
5z— 0.01. After the background is taken in account on each 



bin, the redshift of the bin with the largest number of galax- 
ies is considered the cluster estimated redshift. 

In this work redshifts are obtained through the appli- 
cation of an empirical relation to r mean , (g-i) median and 
(r-i) median- The process is started with a guess redshift 
{z gU ess = 0.15) and is iterated until convergence is achieved. 
For the 512 clusters used here the photometric redshift ac- 
curacy is simply given by the comparison of the measured 
spectroscopic redshift and the photometric estimate. So, it 
is clear that this proce s s is no t guided whatsoever. For com- 
parison, IKoester et alJ (|2007t ) estimate the accuracy of their 
estimates by selecting all clusters from their catalog that 
have spectroscopic redshifts for their BCGs. For those, they 
compare Zphot and z 3p ec, where z p hot is the cluster photo-z 
estimated from the maxBCG algorithm and z 3p e C is the spec- 
troscopic redshift measured for the BCG galaxy in question. 
That seems a fair comparison, but the authors also recognize 
that ~ 16% of their clusters suffer from projection effects, 
which could affect cluster redshift estimates based on sev- 
eral galaxie s (and not only the BCG). On the other hand, 
I Goto et al l l|2002l ) seem to guide the comparison of z p hot 
and z S pec- In their words: "the redshift of the SDSS spectro- 
scopic galaxy within the detected radius and with nearest 
spectroscopic redshift to the estimation is adopted as the 
real redshift" . Such procedure obviously biases the compar- 
ison to small redshift offsets, based on a single galaxy. 

I decide to esti mate the resid u als in a sim ilar way to 
what was done by IKoester et alJ (|2007t ) and IGoto et al.l 
(2002). For that purpose I did not consider the 'Main' flux- 
limited sample of SDSS. Instead, I used only the 197, 956 
LRGs, with spectra available, selected in §3. In the first case 
I selected the nearest galaxy to the cluster center (within a 
maximum aperture of 60"). If the LRG selected is close to 
the center it might be the BCG of the cluster. However, it is 
important to note that I do not make any magnitude or color 
requirement for that selection. This simple approach results 
in a = 0.021 (or <x = 0.017) for the full redshift range of 
the clusters used here. When I restrict the sample to clusters 
at 0.10 < 2 < 0.30 I fin d a = 0.014 ( o r op = 0.011). The 
last results are closer to IKoester et alJ (|2007h for the same 
redshift interval. For the full sample there are 111 (~ 23%) 
clusters (out of 489 with a valid photo-z), while there are 71 
at 0.10 < 2 < 0.30. 

To perform a comparison to IGoto et alJ ([2002) I did 
something similar to what they did. The only difference is 
to use a radius of 0.50 /i -1 Mpc, instead of the "detection" 
radius available within their catalog. That should not result 
in meaningful differences as you do not want to select a 
galaxy that is too far from the cluster center. So, within 
0.50 h -1 Mpc I select the LRG with the closest spectroscopic 
redshift to the value of Zphot for each cluster. For 301 clusters 
(~ 62%) there is at least one LRG inside 0.50 h" 1 Mpc. Out 
of those, 206 are at 0.10 < z < 0.30. I found a = 0.018 (or 
cr = 0.017) for the full sample and a = 0.017 (or a = 0.015) 
at 0.10 < 2 sC 0.30. 

One problem with these tests is the fact that correlat- 
ing the value of z p hot for a cluster with z 3p ec for a single 
galaxy may lead to wrong matches due to projection ef- 
fects. As these clusters represent a combination of objects 
from different catalogs in the literature they are not sup- 
posed to have their centroid perfectly matched with a BCG 
(substructure can affect the centroid determination). That 
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is not the case of the catalog from lKoester et ah! l|2007l ). So, 
if their BCG has an spectroscopic observation and their code 
works properly, they will have a good correlation between 
Zphot an d z spec (except, perhaps, for clusters with strong 
projection effects). Nonetheless, it is encouraging that the 
two tests above result in an improved accuracy. 

To minimize the influence of projection effects 1 also es- 
timated the accuracy in a third way. For each cluster I select 
all LRGs within 360" from the cluster center. From those, 
I check if the galaxy closest to the centroid is at a maxi- 
mum distance of 120". If it is, I assume the redshift of this 
LRG as a reference (z re f). Then, from all the other galaxies 
(LRGs) selected within 360" I take those that have a max- 
imum redshift difference \zi rg — Zref\ of 0.030. If I end up 
with at least three galaxies I take the mean of these redshifts 
to be the value of z apec . When comparing these to z p hot I 
find a = 0.016 (or <to = 0.018) for the full redshift range and 
a = 0.011 (or cr = 0.010) at 0.10 < z < 0.30. There are 125 
clusters (~ 26%) in the full sample and 81 in the restricted 
redshift interval. Note that this process is not guided what- 
soever. I use an aperture that does not scale with redshift 
(360") and check if there are at least three LRGs at the same 
redshift, taking as reference the redshift of the LRG closest 
to the cluster center (within 120"). As I impose a minimum 
number of three galaxies, projection effects are minimized 
and we can see that the acc uracy is greatly impr oved, reach- 
ing the 0.01 level found bv lKoester et alj <|2007h . 



4.5 Application of the photometric redshift 
estimator 

As an application of the empirical photometric redshift rela- 
tion obtained in §4.3 I used SDSS data to derive new redshift 
estimates for the supplemental ver sion of the Northe rn Sky 
Optical Cluster Survey (NoSOCS, lLopes et al-lliool ). This 
cluster catalog contains candidates to z ~ 0.50, but the red- 
shift estimates were based in a simple magnitude-redshift 
relation. For that project the magnitude limit adopted was 
r = 21. Thus, due to the large photometric errors, the use 
of the g — r color was not possible with data from the Dig- 
itized Second Palomar Ob s ervato ry Sky Survey (DPOSS). 
For details see lLopes et al l (|2004T l. 

This supplemental version of NoSOCS comprises 9956 
cluster candidates over 2,700 square degrees. I have recently 
searched the data release 5 of SDSS to see which clusters 
already have SDSS data. After some catalog cleaning and 
inspection of all cluster regions from SDSS I found that 
7409 NoSOCS clusters are well imaged. I then derived new 
photometric redshifts for these systems using the relation 
obtained in this work. Actually, I got new redshifts with- 
out re-centering and also considering n ew coordinates, re - 
ferred as luminosity weighted positions (lLopes et al.l l2006). 
As expected, the new redshift distribution has most of clus- 
ters at z < 0.4 with a tail extending to z ~ 0.55. Details 
about this updated version of NoSOCS will be given in fu- 
ture work, where substructure and superposition effects will 
be investigated. For the lower redshift clusters (z < 0.1) the 
updated version will also provide measures of velocity dis- 
persion, virial radius and mass; and this cluster subset will 
be used for comparison to X-rays, as well to study scaling 
relations. 



4.6 Groups in redshift space with photometric 
estimates 

In the DR5 it is also possible to get photometric redshift 
estimates of nearly all galaxies in SDSS. Two sources of 
redshifts are provided. They are simply called photoz and 
photoz2. The fir st one is based on template fitting using 
the technique of ICsabai et al.l (|2003h . The second version 
is provided by the UChicago/Fermilab/NYU group and is 
based on neural networ ks, with implementation similar to 
ICollister fc Lahav (12004 ). For the results shown below I used 
photoz (|Csabai et alj|2003f ). 

For the 512 clusters from the training sample I selected 
photometric redshifts for all galaxies from DR5. The same 
information was also derived for a region of ~ 400 square 
degrees. The boundaries of this area are 168.4 < a < 191.6 
and 20.0 < 5 < 40.0, and were chosen to avoid regions close 
to very nearby clusters, such as Virgo and Coma. This sam- 
ple will work as a large "sky" photometric redshift survey. 

I investigated the possibility to identify groups in red- 
shift space simply using these galaxy photometric redshifts. 
I proceed as follows. First, for each cluster, I start identify- 
ing groups in redshift space. For that purpo se, I use the gap- 
techni que described in lKatgert et al.l l|l996l ) and lOlsen et al.l 
(2005), which identifies gaps in the redshift distribution that 
are larger than a given value to s eparate groups. T he gap 
size adopted is Az = 0.005(1 + z) jOlsen et alj|2005h . which 
is approximately 1,500 km/s in the restframe. I considered 
the photometric redshifts available in the training sample to 
select all galaxies within 0.50 hT 1 Mpc of the cluster center. 
When performing galaxy selection, the only requirement I 
make is that the photo-z of the galaxy should be greater than 
0.01 (to avoid very nearby structures and failures within the 
SDSS estimates). After applying the gap-technique, a num- 
ber of groups in z-space is identified for each cluster 

The next step is to assess the significance of each of 
these groups. For that I consider the area of 400 square 
degrees described above. For each group I draw 1000 sets 
of galaxies from the 400 square degrees catalog. These sets 
have the same number of galaxies as in the cluster region 
where the group was identified. The gap-technique is applied 
exactly as before and then I check the probability of finding 
groups with at least the same number of galaxies at the 
redshift of the original group. A field group is considered if 
its redshift is within ±0.005 of the group identified in the 
cluster region. The significance is given by the difference 
between one and the achieved probability. I only consider 
groups that are significant at the 99% level. 

From all the significant groups I select the one that has 
the smallest redshift difference to the spectroscopic value 
of the cluster in question. This group should also have at 
least three member galaxies (most have many more) and be 
found within 3' of the cluster center. I have also ran this 
group identification procedure with one slight modification, 
which is the gap size. That was modified to Az — 500(1 + 
exp(—(N — 6)/33))/c, where N is the number of galaxie s 
found in the redshift survey of a cluster (|Adami et al.|[l998l ). 
and c is the speed of light in km/s. 

In Figure 6 I summarize the results obtained for the two 
gap sizes adopted. In the bottom panel (b) the gap size con- 
sidered is Az = 0.005(1 + ^), being Az = 500(l + ea;p(-(iV- 
6)/33))/c in the upper panel (a). The total number of clus- 
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Galaxies Photo-zs from 
01 r Template Fitting (Csabai et al. 2003) 



gap = 500(1 + exp(-(N-6)/33)) 



(a) 




gap = 0.005(1 +z) 



(b) 



zXdspec 



Figure 6. Residual plots between spectroscopic redshifts of train- 
ing sample clusters and the closest group identified in redshift 
space using photometric redshifts from SDSS. From the significant 
groups I select the one with smallest redshift difference, at least 
three member galaxies and within 3' of the cluster center. The top 
panel consider the gap size Az = 500(1 + exp(—(N — 6)/33))/c, 
while for the bottom panel I use Az = 0.005(1 + z). Details are 
given in the text. 



ters identified and the residuals between z spec and z p hot are: 
(a) 298 and a = 0.011 (or <x = 0.009); (b) 414 and a = 0.014 
(or cy o = 0.013). 

It is interesting to see that these galaxy photometric 
redshifts provide values accurate enough to trace the local 
environment of clusters. Similar results are achieved when I 
adopted the photometric redshifts of LRGs (§3). From the 
inspection of figure 6 it is clear that the identification of 
groups at higher redshifts (z > 0.4) becomes very hard. 
That is due to the poor sampling of the luminosity function 
of clusters at this redshift interval for r ^ 21. When using 
the gap size Az — 0.005(1+2:) we identify more clusters than 
with the other gap, but at the cost of having an increased 
scatter. 

Considering the larger number of blue galaxies at high 
redshifts the red sequence in some clusters may be ill de- 
fined. So, it is important to consider alternative approaches 
to cluster detection, such as the selection based on galaxy 
photometric redshifts. 



5 CONCLUSIONS 

In this work I described the construction of a large cata- 
log of luminous red galaxies with photometric redshifts at 
z < 0.70 within SDSfl This catalog is based on a empiri- 

2 The LRG catalog can be retrieved from the electronic edition 
of this journal or by request to the author. 



cal relation to derive the photo-zs. Such relation uses three 
bands only for achieving results as accurate as those ob- 
tained by other methods ( sometimes based in m ore bands; 
iPadmanabhan et alj|200l ICollister et~ai1l2007f ). The com- 
parison of photometric and spectroscopic redshifts show no 
large systematic effects in the redshift range probed, which 
suggests that this sample is suitable for large-scale structure 
studies. 

I have also investigated what the main systematics are 
in the estimation of photometric redshifts of galaxy clusters 
at z < 0.55. That represents an improvement respective to 
some deep cluster catalogs derived from SDSS. When these 
catalogs have accurate pho tometric redshift estimates, these 
are truncated at z < 0.44 (|Goto et al.ll2002r i. When they go 
a little further (z = 0.5) the estimates are not as accurate 
jKim et al.ll2002T ). I also showed the relevance of using the 
g — i color, and to a lesser extent the mean r magnitude, 
to improve the photometric redshift accuracy, especially at 
high-z (z > 0.4). Besides that, I show that on top of a "tra- 
ditional" background correction it is very important to se- 
lect galaxies from a fixed luminosity range and perform a 
careful selection of early type galaxies. In this work, this 
pre-selection of red galaxies is made through a variable cut 
(in redshift) in the it — r color. The main advantage of this 
type of selection is to only require that clusters should ex- 
hibit a population of early type galaxies towards their cores. 
There is no need for these galaxies to exhibit a narrow red 
sequence. This type of requirement is very important for 
the cases where the photometric errors are large or the red- 
sequence is still being formed, which could be the case at 
high redshifts. 

The results obtained for clusters are independent of the 
way these are selected. So, the methodology described in 
this work should be valid for any type of clusters, selected by 
different techniques and wavelengths. The only requirement 
is to have the proper filters for separating early and late type 
galaxies and to track the 4000 A break. For SDSS data this 
method works for clusters at z < 0.55. Other considerations 
are made regarding accuracy. I show that the results shown 
here are in good agreement to previous works. 

The empirical relation derived for clusters is applied 
to 7409 clusters f rom the NoSOCS supplemental catalog 
|Lopes et alj|2004 ) which are found within SDSS. For these 
clusters I was able to update the photometric redshift es- 
timates, deriving more accurate values than before (when 
using only magnitudes within the DPOSS data). This clus- 
ter catalog, with the new redshifts, will be updated in a fu- 
ture work, where substructure and superposition effects will 
be investigated. This catalog is also being used to derive 
velocity dispersions and mass estimates for the lower red- 
shift systems (z < 0.1) and to investigate scaling relations 
in clusters. 

Finally, I tried to identify groups in redshift space using 
photometric redshifts of galaxies available in SDSS. I found 
that for ~ 60% of clusters (mostly at z < 0.4) it is possible 
to clearly identify a group using only photometric redshifts 



of galaxies. When comparing 



of clusters to the photo- 



metric redshift of the nearest group, a — 0.011. If a different 
gap size is employed when searching for the groups the rate 
of identified systems increases to ~ 80%, but the accuracy is 
a little worst (a = 0.014). This procedure represents an al- 
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ternative approach for cluster detection, based on on galaxy 
photometric redshifts. 
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